RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703*308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.2Qv or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM. ACCESSIBLE THROUGH THE U S PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3 .0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 
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OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: 



US/09/876, 796 



DATE: 
TIME : 



06/27/2001 
15:33:37 



Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796. 



raw 



Does Not Comply 
Corrected Diskette Needed 



c--> 
c--> 



C--> 3 

4 
5 
6 

Activity . 
7 

C--> 8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 



SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: Horwath, K. L.,et al. 
(ii) TITLE OF INVENTION: Nucleic Acids Sequences Encoding Type III 

Tenebrio Antifreeze Proteins and Method for Assaying 



-> 
-> 



(iii) NUMBER OF SEQUENCES: 48 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dr. Kathleen L. Horwath 

(B) STREET: Department of Biological Sciences , 

(C) CITY: Binghamton 

(D) STATE: New York 
(F) ZIP: 13902-6000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1 . 4 Mb storage 

(B) COMPUTER: IBM AT/ATX compatible 

(C) OPERATING SYSTEM: Windows 95/98 

(D) SOFTWARE: Microsoft Word 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/876, 796 

(B) FILING DATE: 07-Jun-2001 

(C) CLASSIFICATION: 
(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60210446 

(B) FILING DATE: June 8, 2000 
(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Mark Levy, Attorney-at-Law 

(B) REGISTRATION NUMBER: 29,188 

(C) REFERENCE/DOCKET NUMBER: RB125 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 607-722-660 

(B) TELEFAX: 607-724-2207 



Binghamton University 



ERRORED SEQUENCES 

C--> 60 (2) INFORMATION FOR SEQ ID NO: 2 

61 (i) SEQUENCE CHARAC TERISTICS : 

62 (A) LENGTH: /56CTbase pairs- 

63 (B) TYPE: nucleic acid 

64 (C) STRANDEDNESS : double 

65 (D) TOPOLOGY: linear 

66 (ii) MOLECULE TYPE: cDNA to mRNA 

67 (iii) HYPOTHETICAL: no 

68 (iv) ANTI-SENSE: no 

69 (vi) ORIGINAL SOURCE: 

70 (A) ORGANISM: Tenebrio molitor 
C--> 71 (C) INDIVIDUAL ISOLATE: none 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,796 TIME: 15:33:37 

Input Set : A:\SEQLIST* txt 

Output Set: N:\CRF3\06272001\I876796.raw 

C--> 72 (G) CELL TYPE: fat body and whole organism 

73 (vii) IMMEDIATE SOURCE: 

74 (A) LIBRARY: cDNA 

75 (B) CLONE: 13.17 

76 4 ^1) FEATURES 
78 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



E--> 


80 




GTGGATCCAA AGAATTCGGC 


ACGAGACTAC TAAG 


ATG 


AAG 


TTG 


CTC 




81 






















Met 


Lys 


Leu 


Leu 




82 




























-15 


E--> 


84 


TGT 


TGT 


CTA 


ATC 


TCC CTC 


ATT 


CTG 


TTG 


GTC 


ACA 


GTT 


CAG 


GCC 


CTG 




85 


Cys 


Cys 


Leu 


He 


Ser Leu 


He 


Leu 


Leu 


Val 


Thr 


Val 


Gin 


Ala 


Leu 




86 










-10 








-5 










1 


E--> 


88 


ACC 


GAG 


GCA 


CAA 


ATT GAG 


AAA 


CTG 


AAC 


AAG 


ATC 


AGC 


AAA 


AAA 


TGT 




89 


Thr 


Glu 


Ala 


Gin 


He Glu 


Lys 


Leu 


Asn 


Lys 


He 


Ser 


Lys 


Lys 


Cys 




90 








5 








10 










15 




E--> 


92 


CAA 


AAT 


GAA 


AGT 


GGA GTG 


TCG 


CAA 


GAG 


ATC 


ATA 


ACC 


AAA 


GCT 


CGC 




93 


Gin 


Asn 


Glu 


Ser 


Gly Val 


Ser 


Gin 


Glu 


He 


He 


Thr 


Lys 


Ala 


Arg 




94 








20 








25 










30 




E--> 


96 


AAC 


GGT 


GAC 


TGG 


GAG GAC 


GAT 


CCT 


AAA 


CTG 


AAA 


CGC 


CAA 


GTT 


TTT 




97 


Asn 


Gly 


Asp 


Trp 


Glu Asp 


Asp 


Pro 


Lys 


Leu 


Lys 


Arg 


Gin 


Val 


Phe 




98 








35 








40 










45 







126 



171 



216 



E--> 


100 


TGC 


GTG 


GCC 


AGG 


AAC 


GCC 


GGT 


CTG 


GCC 


ACG 


GAA 


TCG 


GGA 


GAG 


GTG 


261 




101 


Cys 


Val 


Ala 


Arg 


Asn 


Ala 


Gly 


Leu 


Ala 


Thr 


Glu 


Ser 


Gly Glu Val 






102 








50 










55 










60 






E--> 


104 


GTG 


GTC 


GAC 


GTG 


TTG 


AGG 


GAG 


AAG 


GTG 


AGG 


AAG 


GTC 


ACT 


GAC 


AAC 


306 




105 


Val 


Val 


Asp 


Val 


Leu 


Arg 


Glu 


Lys 


Val 


Arg 


Lys 


Val 


Thr 


Asp 


Asn 






106 








65 










70 










75 






E--> 


108 


GAC 


GAA 


GAA 


ACT 


GAG 


AAA 


ATC 


ATC 


AAT 


AAG 


TGC 


GCC 


GTC 


AAG 


AGA 


351 




109 


Asp 


Glu 


Glu 


Thr 


Glu 


Lys 


He 


He 


Asn 


Lys 


Cys 


Ala 


Val 


Lys 


Arg 






110 








80 










85 










90 






E--> 


112 


GAT 


ACT 


GTT 


GAA 


GAG 


ACG 


GTG 


TTC 


AAT 


ACT 


TTC 


AAA 


TGT 


GTC 


ATG 


396 




113 


Asp 


Thr 


Val 


Glu 


GlU 


Thr 


Val 


Phe 


Asn 


Thr 


Phe 


Lys 


Cys 


Val 


Met 






114 








95 










100 










105 






E--> 


116 


AAA 


AAC 


AAG 


CCA 


AAG 


TTC 


TCA 


CCA 


GTT 


GAT 


TGA 


ACCACCACGA 




439 




117 


Lys 


Asn 


Lys 


Pro 


Lys 


Phe 


Ser 


Pro 


Val 


Asp 
















118 








110 










115 


















E--> 120 C TAG TAG ATG GTTCAAATGG TGTGCTTTAC ATATAAAAAT AAAGTGTTTC 489 
E--> 122 TGATGTAAAA AAAAAAAAAA AAAAAAAAAA AACTCGAGAG TATTCTAGAG 539 
E--> 124 CGGCCGCGGG CCCATCGTTT TCCACCC 566 
C--> 127 (2) INFORMATION FOR SEQ ID NO: 3 

128 (i) SEQUENCE CHARACTERISTICS: 

129 (A) LENGTH: 134 Amino Acids 

130 (B) TYPE: Amino Acid 

131 (C) STRANDEDNESS : single 

132 (D) TOPOLOGY: linear 

133 (ii) MOLECULE TYPE: Protein 

134 (Hi) HYPOTHETICAL: no 

135 (iv) ANTI-SENSE: no 
13 6 (vi) ORIGINAL SOURCE: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/876 , 796 



DATE: 06/27/2001 
TIME: 15:33:37 



Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 



C- 
C- 



137 (A) ORGANISM: Tenebrio molitor 

-> 13 8 (C) INDIVIDUAL ISOLATE: none 

-> 13 9 (G) CELL TYPE: fat body and whole 

140 (vii) IMMEDIATE SOURCE : 

141 (A) LIBRARY: CDNA 

142 (B) CLONE: 13.17 

143 fcfac ) FEATURE S^ 

14 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

147 Met Lys Leu Leu Cys Cys Leu lie Ser Leu lie 

148 -lS ' ' 1 * ' -10 v " 

150 Gin Ala Leu Tijr G^u Ala GLp lie GJu Ly.s Leu 

-> 151 * ^ -5- & 

153 Lys Cys Gin Asn Glu Ser Gly Val Ser Gin Glu 

154C15 20 25 



organism 



Leu Leu Val 

' - 5 
As^n 'Lys lie 

He He Thr 



Thr Val 



Ser Ly3 



156 Arg Asn Gl^Asp Trp Glu Asp Asp Pro Lys Leu Lys Arg Gin 



_i5_ 



40 



157 ______„___„^__ 

159 Cys Val Ala Arg Asn Ala Gly Leu Ala Thr Glu 

160 (^50 55 

162 Val Asp Val leu Arg Glu Lys Val Arg Lys Val 

163 Q,65 70 _ 

165 Glu Thr Glu Lys lie He Asn LyS Cys Ala Val" 

166 (§0 85 

168 Giu Glu Thr Val Phe Asn Thr Phe Lys Cys Jfc*r 

169 95^^ 100^ . 0-05 



Ser Gly Glu 
60 

Thr Asp Asn 




75 




E--> 171 Lys Phe Ser 
E--> 172 



irs 




Lys Arg Asp 
90 

Met Ly s— Ars-ft- 



Thr VaT^ 




C--> 175 (2) INFORMATION FOR SEQ ID NO: 

17 6 (i) SEQUENCE CHARACTERISTICS: 

177 (A) LENGTH: 116 Amino Acids 

178 (B) TYPE: Amino Acid 

179 (C) STRANDEDNESS: single 

180 (D) TOPOLOGY: linear 

181 (ii) MOLECULE TYPE: Protein 

182 (iii) HYPOTHETICAL: no 

183 (iv) ANTI-SENSE: no 

184 (vi) ORIGINAL SOURCE: 

185 (A) ORGANISM: Tenebrio molitor 
C--> 186 (C) INDIVIDUAL ISOLATE: none 

C-->187 (G ) CELL TYPE: fat body and whole organism 

188 (vii) IMMEDIATE SOURCE: 

189 (A) LIBRARY: CDNA 

190 (B) CLONE: 13.17 

191 * ( i Y ) FFflTTTRF-jS 

193 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

195 Leu Thr Glu Ala Gin He Glu Lys Leu Asn Lys He Ser Lys 

196 15 10 

198 Gin Asn Glu Ser Gly Val Ser Gin Glu He He Thr Lys Ala 

199 20 25 30 
201 Gly Asp Trp Glu Asp Asp Pro Lys Leu Lys Arg Gin Val Phe 




Lys Cys 
15 

Arg Asn 
Cys Val 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,796 TIME: 15:33:37 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 

202 35 40 45 

204 Ala Arg Asn Ala Gly Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp 

205 50 55 60 

207 Val Leu Arg Glu Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr 

208 65 70 75 80 

210 Glu Lys lie lie Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu 

211 85 90 95 

213 Thr Val Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys Phe 

214 100 r-J 105 110 
E--> 216 Ser Pro Val Asp f*J 
E--> 217 115 ^ 

C--> 344 (2) INFORMATION FOR SEQ ID NO: 7 

34 5 (i) SEQUENCE CHARACTERISTICS: 

346 (A) LENGTH: 133 Amino Acids 

347 (B) TYPE: Amino Acid 
348. (C) STRANDEDNESS: single 

349 (D) TOPOLOGY: linear 

350 (ii) MOLECULE TYPE: Protein 

351 (iii) HYPOTHETICAL: no 

352 (iv) ANTI-SENSE: no 

353 (vi) ORIGINAL SOURCE: 

354 (A) ORGANISM: Tenebrio molitor 
C--> 355 (C) INDIVIDUAL ISOLATE: none 
C--> 356 (G) CELL TYPE: fat body and whole organism 

357 (vii) IMMEDIATE SOURCE: 

358 (A) LIBRARY: cDNA 

359 (B) CLONE: 2.2, 2.3, and 7.5 
3 60 - ( i tt ) F FA TTTP F S _ 
362 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 



E--> 
E--> 
C--> 



364 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 


Phe 


Ala 


Ala 


He 


Val 


He 


Gly Ala 


365 








-15 










-10 










-5 






367 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


368 






1 








5 










10 










370 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


371 


15 










20 










25 










30 


373 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


374 










35 










40 










45 




376 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


377 








50 










55 










60 






379 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


380 






65 










70 










75 








382 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


383 




80 










85 










90 










385 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


386 


95 




















105 










110 


388 


Phe 


Ser 


Pro 


lie 


Asp 


r 






















389 










115 
























392 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 8 

















393 (i) SEQUENCE CHARACTERISTICS: 



file://C:\CRF3\Outhold\VsrI876796.htm 6/27/01 



PageS of 28 



RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876 , 796 TIME: 15:33:38 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 

394 . (A) LENGTH: 115 Amino Acids 

395 (B) TYPE: Amino Acid 

396 (C) STRANDEDNESS : single 

397 (D) TOPOLOGY: linear 

398 (ii) MOLECULE TYPE: Protein 

399 (iii) HYPOTHETICAL: no 

400 (iv) ANTI-SENSE: no 

401 (vi) ORIGINAL SOURCE: 

402 (A) ORGANISM: Tenebrio molitor 
C--> 4 03 (C) INDIVIDUAL ISOLATE: none 

C--> 404 (G) CELL TYPE: fat body and whole organism 

405 (vii) IMMEDIATE SOURCE: 

406 (A) LIBRARY: cDNA 

407 (B) CLONE: 2.2, 2.3, and 7.5 

408 C^H^fi^^RES— " 

410 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 



412 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


L YS 


Arg 


Asn 


Lys 


He 


Ser 


Lys Glu 


Cys 


413 


1 








5 










10 










15 




415 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr 


416 








20 










25 










30 






418 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


419 






35 










40 










45 








421 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


422 




50 










55 










60 










424 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


Val 


Asp 


425 


65 










70 










75 










80 


427 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


428 










85 










90 










95 




430 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 


431 








100 










105 










110 







E--> 4 33 Pro He Asp M 

E--> 434 115 

C--> 500 (2) INFORMATION FOR SEQ ID NO: 10 

501 (i) SEQUENCE CHARACTERISTICS: 

502 (A) LENGTH: 133 Amino Acids 

503 (B) TYPE: Amino Acid 

504 (C) STRANDEDNESS: single 

505 (D) TOPOLOGY: linear 

506 (ii) MOLECULE TYPE: Protein 

507 (iii) HYPOTHETICAL: no 

508 (iv) ANTI-SENSE: no 
50 9 (vi) ORIGINAL SOURCE: 

510 (A) ORGANISM: Tenebrio molitor 

C--> 511 (C) INDIVIDUAL ISOLATE: none 

C--> 512 (G) CELL TYPE: fat body and whole organism 

513 (vii) IMMEDIATE SOURCE: 

514 (A) LIBRARY: cDNA 

515 (B) CLONE: 3.4 

516 jjjt^-FfiftTTraEs - ^ 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,796 TIME: 15:33:38 

Input Set : A:\SEQLIST.trt 

Output Set: N:\CRF3\06272001\I876796.raw 



518 (xi) 





l Y it: u 


Lys 


Leu 


Leu 


Leu 




tr lit; 


nla 


file 


H.J.CI 


Ala 
Aid 


He 


Val 


He 




£\A.ci 


521 








-15 










-10 










-5 






523 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


524 






1 








5 










10 










526 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


527 


15 










20 










25. 










30 


529 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


530 










35 










40 










45 




532 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala Gly Asp 


Thr 


Asn 


533 








50 










55 










60 






535 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


536 






65 










70 










75 








538 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


539 




80 










85 










90 










541 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Val 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 



542 95 IQ0 y 105 110 

E--> 544 Phe Ser Pro He Asp 

E--> 545 115 

C--> 54 8 (2) INFORMATION FOR SEQ ID NO: 11 

54 9 (i) SEQUENCE CHARACTERISTICS: 

550 (A) LENGTH: 115 Amino Acids 

551 (B) TYPE: Amino Acid 

552 (C) STRANDEDNESS : single 

553 (D) TOPOLOGY: linear 

554 (ii) MOLECULE TYPE: Protein 

555 (Hi) HYPOTHETICAL: no 

556 (iv) ANTI-SENSE: no 

557 (vi) ORIGINAL SOURCE: 

558 (A) ORGANISM: Tenebrio molitor 
C-->559 (C) INDIVIDUAL ISOLATE: none 
C-->560 (G) CELL TYPE: fat body and whole organism 

561 (vii) IMMEDIATE SOURCE: 

562 (A) LIBRARY: cDNA 

563 (B) CLONE: 3.4 

564 - ■{ W)TTflft TT T P flfi 

566 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 




568 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


569 


1 








5 










10 










15 




571 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


lie 


Asp 


Lys 


Val 


Arg 


Thr 


572 








20 










25 










30 






574 


Gly Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


575 






35 










40 










45 








577 


Ser 


Lys 


Lys 


Thr Gly 


Val. Ala 


Thr 


Glu 


Ala 


Gly Asp 


Thr 


Asn 


Val 


Glu 


578 




50 










55 










60 










580 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


Val 


Asp 


581 


65 










70 










75 










80 


583 


Lys 


He 


Val 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


584 










85 










90 










95 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876, 796 TIME: 15:33:38 

input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 



x I1J. 



586 Ala Tyr Asp Thr Phe Lys Val lie Tyr Asp Ser Lys Pro Asp Phe Ser 

587 AJ/O 105 110 
E--> 589 Pro lie Asp 
E--> 590 115 
C--> 656 (2) INFORMATION FOR SEQ ID NO: 13 

657 (i) SEQUENCE CHARACTERISTICS: 

658 (A) LENGTH: 133 Amino Acids 

659 (B) TYPE: Amino Acid 

660 (C) STRANDEDNESS: single 

661 (D) TOPOLOGY: linear 

662 (ii) MOLECULE TYPE: Protein 

663 (iii) HYPOTHETICAL: no 

664 (iv) ANTI-SENSE: no 

665 (vi) ORIGINAL SOURCE: 

666 (A) ORGANISM: Tenebrio molitor 
C--> 667 (C) INDIVIDUAL ISOLATE: none 

C-->668 (G) CELL TYPE: fat body and whole organism 

669 (vii) IMMEDIATE SOURCE: 

670 (A) LIBRARY: cDNA 

671 (B) CLONE: 3.9 

672 ^d^^EZmiKES 

674 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

676 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala lie Val lie Gly Ala 

677 -15 -10 -5 

679 Gin Ala Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser Lys 

680 1 5 10 

682 Glu Cys Gin Gin Glu Ser Gly Val Ser Gin Glu Thr lie Asp Lys Val 

683 15 20 25 30 

685 Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu 

686 35 40 45 

688 Cys Phe Ser Lys Arg Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn 

689 50 55 /faserf Space 60 

E--> 691 Val Glu Val Leu Lys Ala Lys Leu Lys Hi£val Ala Ser Asp Glu Glu 
E--> 692 65 70 75 

694 Val Asp Lys He Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 
E--> 695 80 85 90 

697 Glu Thr Ala Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp 

E--> 698 95 100 J 105 110 

E--> 700 Phe Ser Pro He Asp 
E--> 701 

C--> 704 (2) INFORMATION FOR SEQ ID NO: 14 

705 (i) SEQUENCE CHARACTERISTICS: 

706 (A) LENGTH: 115 Amino Acids 

707 (B) TYPE: Amino Acid 

708 (C) STRANDEDNESS: single 

709 (D) TOPOLOGY: linear 

710 (ii) MOLECULE TYPE: Protein 

711 (iii) HYPOTHETICAL: no 

712 (iv) ANTI -SENSE: no 



Cfc 

100 J 
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713 (vi) ORIGINAL SOURCE: 

714 (A) ORGANISM: Tenebrio molitor 
C--> 715 (C) INDIVIDUAL ISOLATE: none 

C-->716 (G) CELL TYPE: fat body and whole organism 

717 (vii) IMMEDIATE SOURCE: 

718 (A) LIBRARY: cDNA 

719 (B) CLONE: 3.9 

720 fa^?TE7mmfiS-^ 

722 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

724 

725 

727 

728 

730 

731 

733 

734 
E--> 736 
E--> 737 

739 
E--> 740 

742 
E--> 743 
E--> 745 
E--> 746 

C--> 74 9 (2) INFORMATION FOR SEQ ID NO: 15 

750 (i) SEQUENCE CHARACTERISTICS: 

751 (A) LENGTH: 481 base pairs 

752 (B) TYPE: nucleic acid 

753 (C) STRANDEDNESS: double 

754 (D) TOPOLOGY: linear 

75 5 (ii) MOLECULE TYPE: cDNA to mRNA 

756 (iii) HYPOTHETICAL: no 

757 (iv) ANTI-SENSE: no 

758 (vi) ORIGINAL SOURCE: 

759 (A) ORGANISM: Tenebrio molitor 
C--> 760 (C) INDIVIDUAL ISOLATE: none 
C--> 761 (G) CELL TYPE: fat body and whole organism 

762 (vii) IMMEDIATE SOURCE: 

76 3 (A) LIBRARY: cDNA 

764 (B) CLONE: 7.5 

765 ' (-iil ) TDATUftE J " 
767 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

769 GGCACGAGCA AAA ATG AAA CTC CTC TTG TGC TTT GCG TTC GCC GCC 4 6 

770 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

771 -15 -10 
E--> 773 ATC GTC ATC GGA GCT CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA (90 

774 He Val He Gly Ala Gin Ala Leu <Thr Asp Glu Gin He Gin Lys 

775 -5 15 



Leu 


Thr 


Asp 


Glu 


Gin 


lie 


Gin 


Lys Arg Asn 


Lys He 


Ser 


Lys 


Glu 


Cys 


1 








5 






10 








15 




Gin 


Gin 


Glu 


Ser 
20 


Gly 


Val 


Ser 


Gin Glu Thr 
25 


He Asp 


Lys 


Val 
30 


Arg 


Thr 


Gly Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys Met Lys 


Lys His 


Val 


Leu 


Cys 


Phe 






35 










40 




45 








Ser 


Lys 
50 


Arg 


Thr 


Gly 


Val 


Ala 
55 


Thr Glu Ala 


Gly Asp 
60 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


HisVal Ala Ser Asp Glu Glu Val Asp 


65 










70 






75 






80 


Lys 


He 


Val 


Gin 


Lys 
85 


Cys 


Val 


Val Lys Lys 
90 


Ala Thr 


Pro 


Glu 


Glu 
95 


Thr 


Ala 


Tyr 


Asp 

] 


Thr 


Phe 


Lys 


Cys 


He Tyr Asp 
105 


Ser Lys 


Pro 


Asp 
110 


Phe 


Ser 


Pro 


He 


Asp 
115 
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777 AGG AAC AAG ATC AGC AAA GAG TGC CAG CAG GTG TCC GGA GTG TCC 136 

778 Arg Asn Lys lie Ser Lys Glu Cys Gin Gin Val Ser Gly Val Ser 

779 10 15 20 

781 CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAC GAT 181 

782 Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp Asp 

783 25 30 35 

785 CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA 226 

786 Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr Gly 

787 40 45 50 

789 GTG GCA ACC GAA GCC GGA GAC ACC A AT GTG GAG GTA CTC AAA GCC 271 

790 Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys Ala 

791 55 60 65 

79 3 AAG CTG AAG CAT GTG GCC AGC GAC GAA GAG GTG GAC AAG ATC GTG 316 

794 Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He Val 

795 70 75 80 

7 97 CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT 3 61 

798 Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr 

799 85 90 95 

801 GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT 406 

802 Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser Pro 

803 100 105 110 

80 5 ATT GAT TAA TTGTTTTGTA TTTGGCTGAA TTTTGACAAT AAAGGTACTA 455 

806 He Asp * 

807 115 

809 TCGTTATGTA AAAAAAAAAA AAAAAA 4 81 

C--> 88 8 (2) INFORMATION FOR SEQ ID NO: 17 

88 9 (i) SEQUENCE CEARAC^RISXICS^ -4 /O? 

890 (A) LENGTH: (l6p Amino Acids ( ' ^ 

891 (B) TYPE: Amino Acid 

892 (C) STRANDEDNESS: single 

893 (D) TOPOLOGY: linear 

894 (ii) MOLECULE TYPE: Protein 

895 (iii) HYPOTHETICAL: no 

896 (iv) ANTI-SENSE: no 

897 (vi) ORIGINAL SOURCE: 

898 (A) ORGANISM: Tenebrio molitor 
C--> 899 (C) INDIVIDUAL ISOLATE: none 
C-->900 (G) CELL TYPE: fat body and whole organism 

901 (vii) IMMEDIATE SOURCE: 

902 (A) LIBRARY: cDNA 

903 (B) CLONE: 2.2 

904 (J j) FEATURES * 
906 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

908 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

909 - " -55 -50 -45 

911 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

912 -40 -35 -30 

914 Gly Ser Glu Phe Ala Arg Ala Lys Met Lys Leu Leu Leu Cys Phe Ala 

915 -25 -20 -15 



f.fr 
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Q1 7 


Php 


Ala 


Ala 


T 1 ^ 
lie 


VQ1 


Tip 

lie 


uiy 


Al ^ 

ftlQ 


C 1 n 

Vjlil 


Ala 


Leu 


Thr 


A can 


UlU 


r\ 1 n 


Tip 

lie 


Q1 ft 


1 \J 










- s 










1 








J 




920 


Gin 


Lys 


Arg 


As n 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 


y zx 








IU 










ID 










zU 






923 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr Gly Val 


Leu 


Val 


Asp 


Asp 


924 






25 










30 










35 








926 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly 


Val 


927 




40 










45 










50 










929 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


930 


55 










60 










65 










70 


932 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 


933 










75 










80 










85 




935 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


936 








90 










95 










100 






938 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 


Pro 


lie 


Asp 




a* 


V 


939 






105 










110 










lis 









E--> 
E--> 

C--> 1009 (2) INFORMATION FOR SEQ ID NO: 19 

.1010 (i) SEQUENCE CHARACTERISTICS: 

1011 (A) LENGTH: 149 Amino Acids 

1012 (B) TYPE: Amino Acid 

1013 (C) STRANDEDNESS: single 

1014 (D) TOPOLOGY: linear 

1015 (ii) MOLECULE TYPE: Protein 

1016 (iii) HYPOTHETICAL: no 

1017 (iv) ANTI-SENSE: no 

1018 (vi) ORIGINAL SOURCE: 

1019 (A) ORGANISM: Tenebrio molitor 
C--> 1020 (C) INDIVIDUAL ISOLATE: none 
C--> 1021 (G) CELL TYPE: fat body and whole organism 

1022 (vii) IMMEDIATE SOURCE: 

1023 (A) LIBRARY: cDNA 

1024 (B) CLONE: 2.2 

1025 ^i^^EOTaRES ' 

1027 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

1029 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

1030 -30 -25 -20 

1032 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1033 -15 -10 -5 

1035 Gly Ser Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser Lys 

1036 1 5 10 

1038 Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp Lys Val 

1039 15 20 25 30 

1041 Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu 

1042 35 40 45 

1044 Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn 

1045 50 55 60 

1047 Val Glu Val Leu Lys Ala Lys Leu Lys His' Val Ala Ser Asp Glu Glu 

1048 65 70 75 

1050 Val Asp Lys lie Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 
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1051 






1053 






1054 


E- 


-> 


1056 


E- 


- > 


1057 


C- 


-> 


1136 






1137 






1138 






1139 






1140 






1141 






1142 






1143 






1144 






1145 






1146 


C- 


-> 


1147 


C- 


-> 


1148 






1149 






1150 






1151 






1152 






1154 






1156 






1157 






1159 






1160 






1162 






1163 






1165 






1166 






1168 






1169 






1171 






1172 






1174 






1175 






1177 






1178 






1180 






1181 






1183 






1184 


E- 


-> 


1186 


E- 


-> 


1187 


c- 


-> 


1257 






1258 






1259 






1260 
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80 85 90 

Glu Thr Ala Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp 

95 IQJt 105 110 




115 

(2) INFORMATION FOR SEQ 

(i) SEQUENCE CHAF 

(A) LENGTH: (169 atmino 

(B) TYPE: An^irftcrAcid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: no 
(iv) ANTI-SENSE: no 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 
(C) INDIVIDUAL ISOLATE: none 
(G) CELL TYPE: fat body and whole organism 
(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: 2.3 
( «L-ii ) rDATURDO > 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Met Gly Ser 

Arg Gly Ser 
-40 

Gly Ser Glu 
-25 

Phe Ala Ala 
-10 

Gin Lys Arg 

Ser Gin Glu 
25 

Pro Lys Met 

40 45 50 

1177 Ala Thr Glu Ala Gly Asp Thr Asri Val Glu Val Leu Lys Ala Lys Lea 
55 60 65 70 

Lys His Val Ala Ser Asp Glu Glu Val Asp Lys lie Val Gin Lys Cys 

75 80 85 f 

Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys 

90 95 
Cys lie Tyr Asp Ser Lys Pro Asp Phe Ser Pro I^e Asp 
* 105 no ' v lis 

(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 9 Amino Acids 

(B) TYPE: Amino Acid 



Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val Pro 


-55 










-50 










-45 




His 


Met 


Ala 


Ser 


Met 


Thr 


Gly 


Gly 


Gin 


Gin 


Met 


Gly Arg 










-35 










-30 






Phe 


Ala 


Arg 


Ala 


Lys 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe Ala' 








-20 










-15 








He 


Val 


He 


Gly 


Ala 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin He. 






-5 










1 








5 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly Val 


10 










15 










20 




Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr 


Gly Val 


Leu 


Val 


Asp As^p 










30 










35 




Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly Val 



nr 
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1261 (C) STRANDEDNESS : single 

1262 (D) TOPOLOGY: linear 

1263 (ii) MOLECULE TYPE: Protein 

1264 (iii) HYPOTHETICAL: no 

1265 (iv) ANTI-SENSE: no 

1266 (vi) ORIGINAL SOURCE: 

1267 (A) ORGANISM: Tenebrio molitor 
C--> 1268 (C) INDIVIDUAL ISOLATE: none 

C--> 1269 (G) CELL TYPE: fat body and whole organism 

1270 (vii) IMMEDIATE SOURCE: 

1271 (A) LIBRARY: CDNA 

1272 (B) CLONE: 2.3 

1273 ( iA)l'lLAl'UUl!ij — ' 

1275. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

1277 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

1278 -30 -25 -20 

1280 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1281 -15 -10 -5 

1283 Gly Ser Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser Lys 

1284 1 5 10 

1286 Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr lie Asp Lys Val 

1287 15 20 25 30 

1289 Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu 

1290 35 40 45 

1292 Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn 

1293 50 55 " 60 

1295 Val Glu Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu 

1296 65 70 75 

1298 Val Asp Lys lie Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 

1299 80 85 90 

1301 Glu Thr Ala Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp 

1302 95 10Q 105 110 
E--> 1304 Phe Ser Pro He 
E--> 1305 

C--> 1308 (2) INFORMATION FOR SEQ ID NO: 

1309 (i) SEQUENCE CHAR^jQTERI^ 

1310 (A) LENGTH: (77fjbase pairs' 

1311 (B) TYPE: nucTeic acid 

1312 (C) STRANDEDNESS: double 

1313 (D) TOPOLOGY: linear 

1314 (ii) MOLECULE TYPE: CDNA to mRNA 

1315 (iii) HYPOTHETICAL: no 

1316 (iv) ANTI-SENSE: no 

1317 (vi) ORIGINAL SOURCE: 

1318 (A) ORGANISM: Tenebrio molitor 
C--> 1319 (C) INDIVIDUAL ISOLATE: none 
C--> 1320 ( G ) CELL TYPE: fat body and whole organism 

1321 (vii) IMMEDIATE SOURCE: 

1322 (A) LIBRARY: cDNA 



nojj j. in 

iop 

Asp (* f 
115 W 



|3> 
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1323 (B) CLONE: 13.17 

1324 frix) FEATURE S 

1327 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

1329 TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

13 31 AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 

1332 Met Gly Ser Ser His His His His His His Ser 

1333 -65 -60 -55 

1335 AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 

1336 Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 

1337 -50 -45 -40 

13 39 GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC TGG ATC CAA AGA ATT 186 

1340 Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Trp lie Gin Arg He 

1341 -35 -30 -25 

134 3 CGG CAC GAG ACT ACT AAG ATG AAG TTG CTC TGT TGT CTA ATC TCC 2 31 

1344 Arg His Glu Thr Thr Lys Met Lys Leu Leu Cys Cys Leu He Ser 

1345 -20 -15 -10 

134 8 CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG ACC GAG GCA CAA ATT 276 

1349 Leu He Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin He 

1350 -5 15 

1352 GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA 321 
1353 
1354 

13 56 GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG 366 
1357 
1358 

13 60 GAC GAT CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC 411 
1361 
1362 

1364 GCC GGT CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG 4 56 
1365 
1366 

13 68 AGG GAG AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG 501 
1369 
1370 

1372 AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG 54 6 



1374 



GAG 


AAA 


CTG 


AAC 


AAG 


ATC 


AGC 


AAA 


AAA 


TGT 


CAA 


AAT 


GAA 


AGT 


GGA 


Glu 


Lys 


Leu 


Asn 
10 


Lys 


He 


Ser 


Lys 


Lys 
15 


Cys 


Gin 


Asn 


Glu 


Ser 
20 


Gly 


GTG 


TCG 


CAA 


GAG 


ATC 


ATA 


ACC 


AAA 


GCT 


CGC 


AAC 


GGT 


GAC 


TGG 


GAG 


Val 


Ser 


Gin 


Glu 
25 


He 


He 


Thr 


Lys 


Ala 
30 


Arg 


Asn 


Gly 


Asp 


Trp 
35 


Glu 


GAC 


GAT 


CCT 


AAA 


CTG 


AAA 


CGC 


CAA 


GTT 


TTT 


TGC 


GTG 


GCC 


AGG 


AAC 


Asp 


Asp 


Pro 


Lys 
40 


Leu 


Lys 


Arg 


Gin 


Val 
45 


Phe 


Cys 


Val 


Ala 


Arg 
50 


Asn 


GCC 


GGT 


CTG 


GCC 


ACG 


GAA 


TCG 


GGA 


GAG 


GTG 


GTG 


GTC 


GAC 


GTG 


TTG 


Ala 


Gly 


Leu 


Ala 
55 


Thr 


Glu 


Ser 


Gly 


Glu 
60 


Val 


Val 


Val 


Asp 


Val 
65 


Leu 


AGG 


GAG 


AAG 


GTG 


AGG 


AAG 


GTC 


ACT 


GAC 


AAC 


GAC 


GAA 


GAA 


ACT 


GAG 


Arg 


Glu 


Lys 


Val 
70 


Arg 


Lys 


Val 


Thr 


Asp 
75 


Asn 


Asp 


Glu 


Glu 


Thr 
80 


Glu 


AAA 


ATC 


ATC 


AAT 


AAG 


TGC 


GCC 


GTC 


AAG 


AGA 


GAT 


ACT 


GTT 


GAA 


GAG 


Lys 


He 


He 


Asn 
85 


Lys 


Cys 


Ala 


Val 


Lys 
90 


Arg 


Asp 


Thr 


Val 


Glu 
95 


Glu 


AQG 


G^G 


TT£ 


AAT 


ACT 


TTC 


AAA 


TGT 


GTC 


ATG 


AAA 


AAC 


AAG 


CCA 


AAG 


Thr 


Val 


Phe 


Asn 
100 


Thr 


Phe 


Lys 


Cys 


Val 
105 


Met 


Lys 


Asn 


Lys 


Pro 
110 


Lys 


TTC 


TCA 


CCA 


GTT 


GAT 


TGA 


ACCACCACGA CT AG TAG ATG GTTCAAATGG 


Phe 


Ser 


Pro 


Val 
115 


Asp 


* 





















1377 
1378 

E*-> 1380 TTC TCA CCA GTT GAT TGA ACCACCACGA CT AG TAG ATG GTTCAAATGG 64 3 

1381 
1382 

E--> 1384 TGTGCTTTAC ATATAAAAAT AAAGTGTTTC TGATGTAAAA AAAAAAAAAA 693 
E--> 1386 AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG CCCATCGTTT 74 3 

E--> 1388 TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 777 
C--> 1391 (2) INFORMATION FOR SEQ ID NO: 25 

1392 (i) SEQUENCE CHARACTERISTICS : 

1393 (A) LENGTH : /170 \mino Acids 



lot 



09$?/ 
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RAW SEQUENCE LISTING DATE: 06/27/2001 

PATENT APPLICATION: US/09/876,796 TIME: 15:33:38 

input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 

1394 (B) TYPE: Amino Acid 

1395 (C) STRANDEDNESS : single 

1396 (D) TOPOLOGY: linear 

1397 (ii) MOLECULE TYPE: Protein 

1398 (iii) HYPOTHETICAL: no 

1399 (iv) ANTI-SENSE: no 

1400 (vi) ORIGINAL SOURCE: 

1401 (A) ORGANISM: Tenebrio molitor 
C--> 1402 (C) INDIVIDUAL ISOLATE: none 

C--> 14 03 (G) CELL TYPE: fat body and whole organism 

1404 (vii) IMMEDIATE SOURCE: 

1405 (A) LIBRARY: cDNA 

1406 (B) CLONE: 13.17 

1407 (4v)FKATTTPES * 

1409 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

1411 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro^ 

1412 -55 -50 -45 

1414 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1415 -40 -35 -30 

1417 Gly Ser Glu Phe Ala Arg Ala Lys Met Lys Leu Leu Cys Cys Leu lie 

1418 -25 -20 -15 -10 
1420 Ser Leu lie Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin lie , 

e--> 1421 • . * -5 ^ v ' 4_ 5 Cr^ hcS^orr 

1423 Glu Lys Leu Asn Lys lie Ser Lys Lys Cys Gin Asn Glu Ser Gly Val, 

1424 10 15 20 

1426 Ser Gin Glu lie lie Thr Lys Ala Arg Asn Gly Asp Trp Glu Asp Asp 

1427 25 30 35 

1429 Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn Ala Gly Leu 

1430 40 45 50 

1432 Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu Arg Glu Lys Val 
. 1433 55 60 65 70 - 

1435 Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu Lys lie lie Asn Lys 

1436 75 80 85 

1438 Cys Ala Val Lys Arg Asp Thr Val Glu Glu Thr Val Phe Asn Thr Phe 

1439 90 95 1 °°r< /r 
E--> 1441 Lys Cys Val Met Lys Asn Lys Pro Lys Phe Ser Pro Val Aspfc) 
E--> 1442 * 105 110 115 . * ^ 
C--> 1512 (2) INFORMATION FOR SEQ ID NO: 27 * 

1513 (i) SEQUENCE CHARACTERISTICS: 

1514 (A) LENGTH: 149 Amino Acids 

1515 (B) TYPE: Amino Acid 

1516 (C) STRANDEDNESS: single 

1517 (D) TOPOLOGY: linear 

1518 (ii) MOLECULE TYPE: Protein 

1519 (iii) HYPOTHETICAL: no 

1520 (iv) ANTI-SENSE: no 

1521 (vi) ORIGINAL SOURCE: 

1522 (A) ORGANISM: Tenebrio molitor 
C--> 1523 (C) INDIVIDUAL ISOLATE: none 
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RAW SEQUENCE LISTING DATE: 06/2 7/2001 

PATENT APPLICATION: US/09/876,796 TIME: 15:33:38 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 

C--> 1524 (G) CELL TYPE: fat body and whole organism 

1525 (vii) IMMEDIATE SOURCE: 

1526 (A) LIBRARY : cDNA 

1527 (B) CLONE: 13.17 

1528 fc lxjrEATURCS — ■ 

1530 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 





Met 


fit; 


Ser 


Ser 


HIS 


nlS 


His 


rll S 


TT -I o 

nlS 


His 


Ser 


ber \jLy 


Leu 


Val 


Pro 


1 ~i i 
ID j o 








*3 n 
J U 










- ZD 








- 20 






1535 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly Gly 


Gin Gin 


Met 


Gly 


Arg 


1536 






-15 










-10 








-5 








1538 


Gly 


Leu 


Thr 


Glu 


Ala 


Gin 


He 


Glu 


Lys 


Leu 


Asn 


Lys He 


Ser 


Lys 


Lys 


1539 




1 








5 










10 








15 


1541 


Cys 


Gin 


Asn 


Glu 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


He 


He Thr 


Lys 


Ala 


Arg 


1542 










20 










25 








30 




1544 


Asn 


Gly 


Asp 


Trp 


Glu 


Asp 


Asp 


Pro 


Lys 


Leu 


Lys 


Arg Gin 


Val 


Phe 


Cys 


1545 








35 










40 








45 






1547 


Val 


Ala 


Arg 


Asn 


Ala 


Gly 


Leu 


Ala 


Thr 


Glu 


Ser 


Gly Glu 


Val 


Val 


Val 


1548 






50 










55 








60 








1550 


Asp 


Val 


Leu 


Arg 


Glu 


Lys 


Val 


Arg 


Lys 


Val 


Thr 


Asp Asn 


Asp 


Glu 


Glu 


1551 




65 










70 










75 








1553 


Thr 


Glu 


Lys 


He 


He 


Asn 


Lys 


Cys 


Ala 


Val 


Lys 


Arg Asp 


Thr 


Val 


Glu 


1554 


80 










85 










90 








95 


1556 


Glu 


Thr 


Val 


Phe 


Asn 


Thr 


Phe 


Lys 


Cys 


Val 


Met 


Lys Asn 


Lys 


Pro 


Lys 


1557 










100 










105 








110 




1559 


Phe 


Ser 


Pro 


Val 


Asp 


0, 




















1560 








115 


\* 





















C--> 163 9 (2) INFORMATION FOR SEQ ID NO: 29 

1640 (i) SEQUENCE* CHARACTERISTICS : 

1641 (A) LENGTH: 173 Amino Acids 

1642 (B) TYPE: Amino Acid 

1643 (C) STRANDEDNESS : single 
' . 1644 (D) TOPOLOGY: linear 

164 5 (ii) MOLECULE TYPE: Protein ' 

1646 (iii) HYPOTHETICAL: no 

1647 (iv) ANTI-SENSE: no 
164 8 (vi) ORIGINAL SOURCE: 
1649 (A) ORGANISM: Tenebrio molitor 

C-->1650 (C) INDIVIDUAL ISOLATE: none 

C-->1651 (G) CELL TYPE: fat body and whole organism 

1652 (vii) IMMEDIATE SOURCE: 

1653 (A) LIBRARY: cDNA 

1654 (B) CLONE: 3.4 
C--> 1655 (ix) FEATURE: 

1656 (D) OTHER INFORMATION: Precursor protein with His-tag 

1657 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

1659 Met Gly Ser Ser His His His. His His His Ser Ser Gly Leu Val Pro 

1660 -55 -50 -45 

1662 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

1663 -40 -35 -30 
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1665 


Glv 


Ser 


Glu 


Phe 


Ala 


Ara 


Ala 


Lys 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 






1666 




-25 










- 20 










-15 














1668 


Phe 


Ala 


Ala 


lie 


Val 


He 


Glv 


Ala 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 






1669 


-10 










- 5 










1 








5 








1671 


Gin 


Lys 


Arcr 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser Gly Val 






1672 








10 










15 










20 










1674 


Ser 


Gin 


Glu 


Thr 


lie 


Asp 


Lys 


Val 


Ara 


Thr 


Glv 


Val 


Leu 


Val 


Asp 


Asp 






1675 






25 










30 










35 












1677 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr Gly Val 






1678 




40 










45 










50 














1680 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


T.vq 


T .fin 
-Lie; u. 






1681 


55 










60 










65 










70 






1683 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 


Val 


Asp 


Lys 


He 


Val 


Gin 


Lys 


Cys 






1684 










75 










80 










85 








1686 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


i y x 


fi en 


Thr 


Phe 


Lys 






1687 








90 










95 
















E- 


- > 


1689 


v ax 


lie 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 


Pro 


Tl p. 

11C 


A en 

asp 








E- 


-> 


1690 






105 










110 










ITS 

X A J 








c- 


- > 


1761 


f 2 ^ 


INFORMATION 


FOR 


SEQ 


ID NO: 


31 
























(i) 


SEQUENCE CHARACTERISTICS: 








\ 












1763 






(A) LENGTH: 149 Amino 


Acids 


















1764 






(B) TYPE: 


Amino Acid 






















1765 






(C) STRANDEDNESS : 


single 




















1766 






(D) TOPOLOGY: 


linear 






















1767 




(ii) 


MOLECULE TYPE: 


Protein 






















1768 




(iii) 


HYPOTHETICAL: no 
























1 769 




(iv) 


ANTI -SENSE 


no 


























1770 




(Vi) 


ORIGINAL SOURCE: 
























1771 






(A) ORGANISM: 


Tenebrio molitor 












Q- 


- > 


1772 






(C) INDIVIDUAL ISOLATE: none 














r- 


- > 


1771 






(G) CELL TYPE: 


fat body and whole 


organism 










1774 


(vii) 


IMMEDIATE SOURCE: 
























1775 






(A) LIBRARY: CDNA 
























1776 






(B) CLONE 


3.4 






















p _ 


_ S 


1777 
J. / / / 




(ix) 


FEATURE : 




























1778 






'(D) OTHER 


INFORMATION 


Mature 


Protein 


with His-tag 






i 77Q 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 31: 














1781 


Met 


Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val 


Pro 






1782 










30 










25 










-20 








1784 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly 


Gly 


Gin 


Gin 


Met 


Gly 


Arg 






1785 








-15 










-10 










-5 










1787 


Gly 


Ser 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 






1788 






1 








5 










10 














1790 


Glu 


Cys 


Gin 


Gin 


Val 


Ser Gly Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 






1791 


15 










20 










25 










30 






1793 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 






1794 










35 










40 










45 








1796 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr Gly Val 


Ala. 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 






1797 








50 










55 










60 
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1799 Val Glu Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu 

1800 65 70 75 

1802 Val Asp Lys He Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 

1803 80 85 90 

1805 Glu Thr Ala Tyr Asp Thr Phe Lys Val He Tyr Asp Ser Lys Pro Asp 

1806 95 10/6 105 110 
E--> 1808 Phe Ser Pro He Asp £f 
E--> 1809 115 ^ 
C--> 1888 (2) INFORMATION FOR SEQ ID NO: 33 

1889 (i) SEQUENCE CHARACTERISTICS: 

1890 (A) LENGTH: 173 Amino Acids 

1891 (B) TYPE: Amino Acid 

1892 (C) STRANDEDNESS : single 

1893 ( D ) TOPOLOGY: linear 

1894 (ii) MOLECULE TYPE: Protein 

1895 (iii) HYPOTHETICAL: no 

1896 (iv) ANTI-SENSE: no 

1897 (vi) ORIGINAL SOURCE: 

1898 (A) ORGANISM: Tenebrio molitor 
C--> 1899 (C) INDIVIDUAL ISOLATE: none 
C--> 1900 (G) CELL TYPE: fat body and whole organism 

1901 (vii) IMMEDIATE SOURCE: 

1902 (A) LIBRARY: cDNA 

1903 (B) CLONE: 3.9 
C--> 1904 (ix) FEATURE: 

1905 (D) OTHER INFORMATION: Precursor Protein with His-tag 

1906 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 





1908 


Met 


Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val 


Pro 




1909 








-55 










-50 










-45 








1911 


Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly 


Gly 


Gin 


Gin 


Met 


Gly 


Arg 




1912 






-40 










L 35 










-30 










1914 


Gly 


Ser 


Glu 


Phe 


Ala 


Arg 


Ala 


Lys 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 




1915 




-25 










-20 










-15 












1917 


Phe 


Ala 


Ala 


He 


Val 


He 


Gly 


Ala 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 




1918 


-10 










-5 










1 








5 






1920 


Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Glu 


Ser 


Gly 


Val 




1921 








10 










15 










20 








1923 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr 


Gly 


Val 


Leu 


Val 


Asp 


Asp 




1924 






25 










30 










35 










1926 


Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Arg 


Thr 


Gly 


Val 




1927 




40 










45 










50 












1929 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 




1930 


55 






60 










65 










70 


E--> 


1932 


Lys 


His Val Ala 


Ser Asp Glu Glu Val Asp Lys lie Val Gin 


Lys Cys 


E--> 


1933 






75 








80 








85 






1935 


Val 


Val 


Lys 


Lys 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


E--> 


1936 








90 










95 
















E--> 


1938 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


Phe 


Ser 


Pro 


He 


Asp 


«5 




JO 


E--> 


1939 




105 










110 










115 
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C--> 2009 (2) INFORMATION FOR SEQ ID NO: 35 

2010 (i) SEQUENCE CHARACTERISTICS: 

2011 (A) LENGTH: 149 Amino Acids 

2012 (B) TYPE: Amino Acid 

2013 (C) STRANDEDNESS : single 

2014 (D) TOPOLOGY: linear 

2015 (ii) MOLECULE TYPE: Protein 

2016 (iii) HYPOTHETICAL: no 

2017 (iv) ANTI-SENSE: no 

2018 (vi) ORIGINAL SOURCE: 

2019 (A) ORGANISM: Tenebrio molitor 
C--> 2020 (C) INDIVIDUAL ISOLATE: none 

C-->2021 (G) CELL TYPE: fat body and whole organism 

2022 (vii) IMMEDIATE SOURCE: 

2023 (A) LIBRARY: CDNA 

2024 (B) CLONE: 3.9 
C--> 2025 (ix) FEATURE: 

2026 (D) OTHER INFORMATION: Mature Protein with His-tag 

2027 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
2029 
2030 
2032 
2033 
2035 
2036 
2038 

. 2039 

2041 

2042 

2044 

2045 
E--> 2047 
E--> 2048 

2050 
E--> 2051 

2053 
E--> 2054 
E--> 2056 
E--> 2057 

C--> 2139 (2) INFORMATION FOR' SEQ ID NO: 37 

2140 (i) SEQUENCE CHARACTERISTICS: 

2141 (A) LENGTH: 173 Amino Acids 

2142 (B) TYPE: Amino Acid 

2143 (C) STRANDEDNESS: single 

2144 (D) TOPOLOGY: linear 

2145 (ii) MOLECULE TYPE: Protein 

2146 (iii) HYPOTHETICAL: no 

2147 (iv) ANTI-SENSE: no 

2148 (vi) ORIGINAL SOURCE: 



Met 


Gly 


Ser 


Ser 


His 
■30 


His 


His 


His 


His 


His 
-25 


Ser 


Ser 


Gly 


Leu 


Val 
-20 


Pro 


Arg 


Gly 


Ser 


His 
-15 


Met 


Ala 


Ser 


Met 


Thr 
-10 


Gly 


Gly 


Gin 


Gin 


Met 
-5 


Gly 


Arg 


Gly 


Ser 


Leu 
1 


Thr 


Asp 


Glu 


Gin 
5 


He 


Gin 


Lys 


Arg 


Asn 
10 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Glu 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


15 










20 










25 










30 


Arg 


Thr 


Gly 


Val 


Leu 
35 


Val 


Asp 


Asp 


Pro 


Lys 
40 


Met 


Lys 


Lys 


His 


■ Val 
45 


Leu 


Cys 


Phe 


Ser 


Lys 


Arg 


Thr 


Gly 


Val 


Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 








50 










55 








60 






Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


Lys 


HisVal Ala Ser Asp Glu Glu 






65 










70 










75 








Val 


Asp 
80 


Lys 


He 


Val 


Gin 


Lys 
85 


Cys 


Val 


Val 


Lys 


Lys 
90 


Ala 


Thr 


Pro 


Glu 


Glu 


Thr 


Ala 


Tyr 


Asp 


Thr 


Phe 


Lys 


Cys 


He 


Tyr 


Asp 


Ser 


Lys 


Pro 


Asp 


95 








100 










105 










110 


Phe 


Ser 


Pro 


He 


Asp 
115 


(9 
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2149 


c _ 


- > 


2150 


c - 


_ > 


2151 






2152 






2153 






2154 


C - 


r 


21 55 

& A. -J -t 






2156 






2157 






2159 






Zi J-\J\J 






J. u z. 






2163 






9 1 5 

z X U J 






Z X o o 






9 1 fifi 






9 1 fiQ 






2171 






2172 






2174 












9177 






217 8 






9 1 RD 






9181 

Z X O -L 






9 1 ft ^ 






9 1 ft 4 

Z X O *± 






2186 






9 1 R7 

Z. X o / 




_ s 


9 1 RQ 

4& X O .7 


Hi 


_ > 


2190 




_ s. 


9 9 fin 






9961 

j£ Z VJ X 






9969 
z o z 






996 "3 

Z Z U J 






9964 

Z Z U 4 






ZZD J 






2266 






2267 






2268 






2269 






2270 


c- 


-> 


2271 


c- 


-> 


2272 






2273 






2274 






2275 


c- 


-> 


2276 






2277 



RAW SEQUENCE LISTING DATE: 06/27/2001 
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Output Set: N:\CRF3\06272001\I876796.raw 

(A) ORGANISM: Tenebrio molitor 

(C) INDIVIDUAL ISOLATE: none 

(G) CELL TYPE: fat body and whole organism 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: 7.5 
(ix) FEATURE: 

(D) OTHER INFORMATION: Precursor Protein with His- tag 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 



Met 


Gly 


Ser 


Ser 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val 


Pro 






-55 










-50 










-45 






Arg 


Gly 


Ser 


His 


Met 


Ala 


Ser 


Met 


Thr 


Gly Gly Gin 


Gin 


Met 


Gly Arg 






-40 










-35 










-30 








Gly 


Ser 


Glu 


Phe 


Ala 


Arg 


Ala 


Lys 


Met 


Lys 


Leu 


Leu 


Leu 


Cys 


Phe 


Ala 




-25 










-20 










-15 










Phe 


Ala 


Ala 


He 


Val 


He 


Gly 


Ala 


Gin 


Ala 


Leu 


Thr 


Asp 


Glu 


Gin 


He 


-10 










-5 










1 








5 




Gin 


Lys 


Arg 


Asn 


Lys 


He 


Ser 


Lys 


Glu 


Cys 


Gin 


Gin 


Val 


Ser 


Gly 


Val 








10 










15 










20 






Ser 


Gin 


Glu 


Thr 


He 


Asp 


Lys 


Val 


Arg 


Thr Gly Val 


Leu 


Val 


Asp 


Asp 






25 










30 










35 








Pro 


Lys 


Met 


Lys 


Lys 


His 


Val 


Leu 


Cys 


Phe 


Ser 


Lys 


Lys 


Thr 


Gly Val 




40 










45 










50 










Ala 


Thr 


Glu 


Ala 


Gly 


Asp 


Thr 


Asn 


Val 


Glu 


Val 


Leu 


Lys 


Ala 


Lys 


Leu 


55 










60 










65 










70 


Lys 


His 


Val 


Ala 


Ser 


Asp 


Glu 


Glu 
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(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 


39 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single /I (T\ 

(D) TOPOLOGY: linear - J \J 
(ii) MOLECULE TYPE: Protein fj ^ 

(iii) HYPOTHETICAL: no If* 
(iv) ANTI-SENSE: no H 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 

(C) INDIVIDUAL ISOLATE: none 

(G) CELL TYPE: fat body and whole organism 
(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: 7.5 
(ix) FEATURE: 

(D) OTHER INFORMATION: Mature protein with His -tag 
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2278 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

2280 Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

2281 -30 -25 -20 

2283 Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

2284 -15 -10 -5 

2286 Gly Ser Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser Lys 

2287 1 5 10 

2289 Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr lie Asp Lys Val 

2290 15 20 25 30 

2292 Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys His Val Leu 

2293 35 40 45 

2295 Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala Gly Asp Thr Asn 

2296 50 55 60 

2298 Val Glu Val Leu Lys Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu 

2299 65 70 75 

2301 Val Asp Lys lie Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu 

2302 80 85 90 

2304 Glu Thr Ala Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp 

2305 95 li/0 105 110 
E--> 2307 Phe Ser Pro lie Asp 
E--> 2308 115 

C--> 2468 (2) INFORMATION FOR SEQ ID NO: 45 

24 6 9 (i) SEQUENCE CHAR^C$ERIS3 

2470 (A) LENGTH : (481/base pairs 

2471 (B) TYPE: nucleic acid 

2472 (C) STRANDEDNESS : double 

2473 (D) TOPOLOGY: linear 

2474 (ii) MOLECULE TYPE: cDNA to mRNA 

2475 (iii) HYPOTHETICAL: no 

2476 (iv) ANTI-SENSE: no 

2477 (vi) ORIGINAL SOURCE: 

2478 (A) ORGANISM: Tenebrio molitor 
C--> 2479 (C) INDIVIDUAL ISOLATE: none 

C-->2480 (G) CELL TYPE: fat body and whole organism 

2481 (vii) IMMEDIATE SOURCE: 

2482 (A) LIBRARY: cDNA 

2483 (B) CLONE: 2.2 

C--> 2484 (ix) FEATURE: C gj^etljuj 

2485 (D) OTHER INFORMATION: (g cTncensu g) of Seq ID #44 with Tm 13.17 

2486 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 45: 

2488 GGCANRNNNN AAR ATG AAR YTN CTC TNN TGY YTN RYN TYC NYC RYY 4 6 

2489 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

2490 -15 -10 

2492 NTN NTN RTC RNA GYT CAG GCY CTN ACC GAN GNA CAR ATN NAG AAA 91 

2493 lie Val lie Gly Ala Gin Ala Leu Thr Asp Glu Gin lie Gin Lys 

2494 -5 15 

24 96 NNG AAC AAG ATC AGC AAA RAR TGY CAR NAN GNR NNY GGA GTG TCN 13 6 

2497 Arg Asn Lys lie Ser Lys Glu Cys Gin Gin Val Ser Gly Val Ser 

2498 10 15 20 
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2524 CCN RTT GAT TRA NYNNYYNNNA YTNGNNNRNR NTTYRANAAT AAAGNNNNTN 458 

2525 Pro He Asp * 

2526 115 / r ^ r ^) 
E--> 2528 TNRTNNNRNA AAAAAAAAAA AAAAAA ( 4 84 J 
C--> 2533 (2) INFORMATION FOR SEQ ID NO: 46 y ,/?// 

2534 (i) SEQUENCE CHAR^jfTERISTj 

2535 (A) LENGTH : (48p base pairs 

2536 (B) TYPE: nucleic acid 

2 537 (C) STRANDEDNESS : double 

2538 (D) TOPOLOGY: linear 

2 539 (ii) MOLECULE TYPE: cDNA to mRNA 

2540 (iii) HYPOTHETICAL: no 

2541 <iv, „,,—.«= ^ JUJ^ ^ 

2543 (vi) ORIGINAL SOURCE: 

2544 (A) ORGANISM: Tenebrio molitor 
C-->2545 (C) INDIVIDUAL ISOLATE: none 
C-->2546 (G) CELL TYPE: fat body and whole organism 

254 7 (vii) IMMEDIATE SOURCE: 

2 54 8 (A) LIBRARY: cDNA 

2549 (B) CLONE: 2.2 

C--> 2550 (ix) FEATURE: S 

2551 (D) OTHER INFORMATION: Concensus of Seq ID #45 with B1/B2 

2552 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 46: 

2554 GGCANRNNNN AAR ATG AAR YTN CTC TNN TGY YTN RYN TYY NYC RYY 46 

2555 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

2556 -15 -10 

2558 NTN NTN RTC NNA GYT CAG GCY NTN ACY NAN GNA NAN NTN NAG NNA 91 

2559 He Val He Gly Ala Gin Ala Leu Thr Asp Glu Gin He Gin Lys 

2560 -5 15 

2562 NNG NNC NAR AYC AGC RNA RAR TGY NAR NNN GNR NNY GGA GTG TCN 13 6 



9 
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E--> 2594 TNRTNNNRNA AAAAAAAAAA AAAAAA 
C--> 2599 (2) INFORMATION FOR SEQ ID NO: 47 

2600 (i) SEQUENCE CHARACTERISTIC^ 



2601 (A) LENGTH: (48 l^Ease pairs 

2602 (B) TYPE: nucTeic acid 

2603 (C) STRANDEDNESS : double 

2604 (D) TOPOLOGY: linear 

2605 (ii) MOLECULE TYPE: CDNA to mRNA 

2606 (iii) HYPOTHETICAL: no 

2607 (iv) ANTI-SENSE: no 

2608 (vi) ORIGINAL SOURCE: 

2609 (A) ORGANISM: Tenebrio molitor 
C--> 2610 (C) INDIVIDUAL ISOLATE: none 

C-->2611 (G) CELL TYPE: fat body and whole organism 

2612 (vii) IMMEDIATE SOURCE: 

2613 (A) LIBRARY: CDNA 

2614 (B) CLONE: 2.2 * 
C--> 2615 (ix) FEATURE: ' 

2616 (D) OTHER INFORMATION: Concensus of Seq. ID #46 with AFP- 3 

2617 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

2 620 GGCNNRNNNN AAR ATG AAR YTN CTC YNN TGY YTN RYN YYY NYY RYY 4 6 

2621 Met Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala 

2622 -15 -10 

2624 NTN NTN RYC NNR RYY YAN GCY NTN ACY NAN RNA NNN NNN NAG NNR 91 

2625 He Val He Gly Ala Gin Ala Leu Thr Asp Glu Gin He Gin Lys 

2626 -5 15 
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E--> 2660 
C--> 2664 

2665 
2666 
2667 
2668 
2669 
2670 
2671 
2672 
2673 
2674 
2675 
2676 
2677 
2678 
2679 
C--> 2680 



C- 
C- 



NNNNNNNNNA AAAAAAAAAA AAAAAA 

(2) INFORMATION FOR SEQ ID NO: 4 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 133 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: Protein 
HYPOTHETICAL: no • 
ANTI- SENSE: no 
ORIGINAL SOURCE: 

(A) ORGANISM: Tenebrio molitor 
(C) INDIVIDUAL ISOLATE: none 
(G) CELL TYPE: fat body and whole organism 
IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CL0NE: u 



(ii) 
(iii) 
(iv) 
(vi) 



(vii) 



^484^) 




(IX) 



s 



E-- 
E-- 



E--> 2689 




FEATURE : 

(D) OTHER INFORMATION: (Genrai Concensus of Clones, 
B2 and AFP-3 

(Xi) SEQUENCE DESCRIPTION: SEP ID NO: 48: 



Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala He Val He Gly 
Cys Leu He Ser Leu He Leu Leu Val Thr 

Thr Leu Val Ala Ala 

Val 
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VERIFICATION SUMMARY 
PATENT APPLICATION: US/09/876,796 

Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 



DATE: 06/27/2001 
TIME: 15:33:39 



332 
: 342 
:203 
220 
:220 
220 
342 
:203 
:220 
:220 
M;220 
M:220 
M:220 
M: 220 
M: 220 
M:220 
L:356 M:220 
L:388 M:342 
L:389 M:203 
L:392 M:220 



L:151 
L:171 
L:172 
175 
186 
187 
216 
217 
220 
231 
232 
282 
293 
294 
344 
355 



L:403 
L:404 
L:433 
L:434 
L:437 
L:448 



220 
220 
342 
203 
220 
220 



C: 
C: 
C: 
E: 
E: 
E: 
C: 
C: 
C: 
E: 
E: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
E: 
E: 
C: 
C: 
C: 
E: 
E: 
C: 
C: 



(32) Invalid/Missing Amino Acid Numbering 



Invalid Stop Code On Error, STOP CODON 
No. of Seq. differs, LENGTH : Input : 134 Found: 135 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Invalid Stop Code On Error, STOP CODON 



No. of Seq. differs, LENGTH : Input : 116 Found: 117 SEQ: 4 



Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format. 
Invalid Stop Code On Error, STOP CODON 



No. of Seq. differs, LENGTH : Input : 133 Found: 134 SEQ: 7 



Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Invalid Stop Code On Error, STOP CODON 



No. of Seq. differs, LENGTH : Input : 115 Found: 116 SEQ: 8 



Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 



F) 
v) 
A) 
B) 



Keyword misspelled or invalid format, [ 
Keyword misspelled or invalid format, [ 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
No. of Bases conflict, Input: 36 Counted 
M:254 Repeated in SeqNo=2 

L:124 M:204 E: No. of Bases differ, LENGTH : Input : 566 Counted:576 SEQ:2 
L:127 M:220 C: Keyword misspelled or invalid format, [(2) 
L:138 M:220 C: Keyword misspelled or invalid format, [(C) 
L:139 M:220 C: Keyword misspelled or invalid format, [(G) 



L:3 M:220 C: 
L:8 M:220 C: 
L:13 M:220 C 
L:14 M:220 
L:20 M:220 
L:21 M:220 
L: 23 M: 220 
L: 26 M: 220 
L:30 M:220 
L:35 M:220 
L:47 M:220 
L: 48 M: 220 
L: 52 M: 220 
L:60 M:220 
L:71 M:220 
L:72 M:220 
L:80 M:254 



1) GENERAL INFORMATION:] 
iv) CORRESPONDENCE ADDRESS:] 
ZIP: ] 

COMPUTER READABLE FORM : ] 
APPLICATION NUMBER : ] 
FILING DATE: ] 
vii) PRIOR APPLICATION DATA:] 
Viii) ATTORNEY/AGENT INFORMATION:] 
ix) TELECOMMUNICATION INFORMATION:] 
2) INFORMATION FOR SEQ ID NO:] 
C) INDIVIDUAL ISOLATE:] 
G) CELL TYPE: ] 
ix) FEATURE:] 

2) INFORMATION FOR SEQ ID NO:] 
C) INDIVIDUAL ISOLATE:] 
G) CELL TYPE: ] 
46 SEQ:2 



INFORMATION FOR SEQ ID 
INDIVIDUAL ISOLATE:] 
CELL TYPE : ] 
SEQ ID: 3 



NO: ] 



SEQ: 3 

(2) INFORMATION FOR SEQ ID 
(C) INDIVIDUAL ISOLATE : ] 
(G) CELL TYPE: ] 
* 



NO: ] 



(2) INFORMATION FOR SEQ ID NO:] 

(C) INDIVIDUAL ISOLATE: ] 

(G) CELL TYPE: ] 

(2) INFORMATION FOR SEQ ID NO:] 

(C) INDIVIDUAL ISOLATE:] 

(G) CELL TYPE: ] 

(2) INFORMATION FOR SEQ ID NO:] 

(C) INDIVIDUAL ISOLATE:] 

(G) CELL TYPE: ] 
* 



(2) INFORMATION FOR SEQ ID 

(C) INDIVIDUAL ISOLATE:] 

(G) CELL TYPE:] 
* 



NO: ] 



(2) INFORMATION FOR SEQ ID NO:] 
(C) INDIVIDUAL ISOLATE:] 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/876,796 



DATE: 06/27/2001 
TIME: 15:33:39 



L:449 M:220 
L:500 M:220 
L:511 M:220 
L:512 M:220 
L:544 M:342 
L:545 M:203 
L:548 M:220 
L:559 M:220 
L:560 M:220 
L:589 M:342 



L:590 M:203 E 
L:593 M:220 C 
L:604 M:220 
L:605 M:220 
L:656 M:220 
L:667 M:220 



L:668 M:220 
L:691 M:333 



L:692 M:332 E 
M:332 Repeated 
L:700 M:342 E 
L:704 M:220 C 
L:736 M:333 E 
L:737 M:332 E 
3 32 Repeated 



745 M:342 E 

773 M:254 

938 M:342 
939 



E 
E 

M:203 E 
1056 M:342 E 
:1057 M:203 E 
1186 M:342 E 
;1187 M:203 E 
:1304 M:342 E 
1305 M:203 E 
:1376 M:254 E 
M:254 Repeated 
L:1388 M:204 E 
L:1421 M:332 E 
M:342 Repeated 
L:1442 M:203 E 
L:1559 M:342 
L:1560 M:203 
L:1689 M:342 
L:1690 M:203 
L:1750 M:336 W 
L:1754 M:336 W 
L:1758 M:336 W 
L:1808 M:342 E 



[(C) INDIVIDUAL ISOLATE:] 
[(G) CELL TYPE: ] 

INFORMATION FOR SEQ ID NO:] 
INDIVIDUAL ISOLATE:] 



[(2) 
[(C) 



[ (G) CELL TYPE: ] 



Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 

Keyword misspelled or invalid format, [(G) CELL TYPE:] 

Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO:] 

Keyword misspelled or invalid format, [(C) INDIVIDUAL ISOLATE:] 

Keyword misspelled or invalid format, [(G) CELL TYPE:] 

Invalid Stop Code On Error, STOP CODON:* 

No. of Seq. differs, LENGTH : Input : 13 3 Found: 134 SEQ: 10 

Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO:] 

Keyword misspelled or invalid format, [(C) INDIVIDUAL ISOLATE:] 

Keyword misspelled or invalid format, [(G) CELL TYPE:] 

Invalid Stop Code On Error, STOP CODON:* 

No. of Seq. differs, LENGTH : Input : 115 Found: 116 SEQ: 11 

Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO:] 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Wrong sequence grouping, Amino acids not in groups! 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 13 
in SeqNo=13 

Invalid Stop Code On Error, STOP CODON:* 

Keyword misspelled or invalid format, [(2) INFORMATION FOR SEQ ID NO: ] 
Wrong sequence grouping, Amino acids not in groups! 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
in SeqNo=14 

Invalid Stop Code On Error, STOP CODON:* 

No. of Bases conflict, Input: 90 Counted: 91 SEQ:15 

Invalid Stop Code On Error, STOP CODON:* 

No. of Seq. differs, LENGTH : Input : 169 Found: 174 SEQ: 17 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 149 Found: 150 SEQ: 19 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 169 Found: 174 SEQ: 21 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 149 Found: 150 SEQ: 23 
No. of Bases conflict, Input: 595 Counted: 591 SEQ: 24 

in SeqNo-24 

No. of Bases differ, LENGTH : Input : 777 Counted: 776 SEQ: 24 
(32) Invalid/Missing Amino Acid Numbering/ SEQ ID: 25 
in SeqNo=25 

No. of Seq. differs, LENGTH : Input : 170 Found: 175 SEQ: 25 

Invalid Stop Code On Error, STOP CODON:* 

No. of Seq. differs, LENGTH : Input : 149 Found: 150 SEQ: 27 

Invalid Stop Code On Error, STOP CODON:* 

No. of Seq. differs, LENGTH : Input : 173 Found: 174 SEQ: 29 

Invalid Amino Acid Number in Coding Region, SEQ ID: 30 

Invalid Amino Acid Number in Coding Region, SEQ ID: 30 

Invalid Amino Acid Number in Coding Region, SEQ ID: 30 

Invalid Stop Code On Error, STOP CODOl^:* 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/876,7 96 



DATE: 06/27/2001 
TIME: 15:33:39 
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L 


2308 


M: 


203 


E 
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Input Set : A:\SEQLIST.txt 

Output Set: N:\CRF3\06272001\I876796.raw 

No. of Seq. differs, LENGTH : Input : 149 Found: 150 SEQ:31 
Wrong sequence grouping/ Amino acids not in groups! 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 33 

in SeqNo=33 
Invalid Stop Code On Error, STOP CODON:* 
Wrong sequence grouping, Amino acids not in groups! 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 35 

in SeqNo=35 
Invalid Stop Code On Error, STOP CODON:* 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Amino Acid Number in Coding Region, SEQ ID: 36 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 173 Found: 174 SEQ: 37 
Invalid Stop Code On Error, STOP CODON:* 
No. of Seq. differs, LENGTH : Input : 149 Found: 150 SEQ: 39 
No. of Bases differ, LENGTH : Input : 481 Counted:484 SEQ.-45 
No. of Bases differ, LENGTH : Input : 481 Counted:484 SEQ:46 
No. of Bases differ, LENGTH : Input : 481 Counted:484 SEQ:47 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID:0 

in SeqNo=4 8 
(2) Invalid Amino Acid Designator, 1 
Invalid Stop Code On Error, STOP CODON:* 
(2) Invalid Amino Acid Designator, 2 

in SeqNo=4 8 

Wrong sequence grouping, Amino acids not in groups! 
(2) Invalid Amino Acid Designator, 2 

No. of Seq. differs, LENGTH : Input : 133 Found: 320 SEQ: 48 
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